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Period for Reply 

A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) FROM 
THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1 .136(a). In no event, however, may a reply be timely filed 
after SIX (6) MONTHS from the mailing date of this communication. 

- If the period for reply specified above is less than thirty (30) days, a reply within the statutory minimum of thirty (30) days will be considered timely. 

- If NO period for reply is specified above, the maximum statutory period will apply and will expire SIX (6) MONTHS from the mailing date of this communication. 

- Failure to reply within the set or extended period for reply will, by statute, cause the application to become ABANDONED (35 U.S.C. § 133). 
Any reply received by the Office later than three months after the mailing date of this communication, even if timely filed, may reduce any 
earned patent term adjustment. See 37 CFR 1.704(b). 

Status 

1 )[x] Responsive to communication(s) filed on 20 August 2004 . 
2a)M This action is FINAL. 2b)D This action is non-final. 

3) D Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quayle, 1935 CD. 11, 453 O.G. 213. 

Disposition of Claims 

4) KI Claim(s) 5.6.9-14 and 29-34 is/are pending in the application. 

4a) Of the above claim(s) ^ is/are withdrawn from consideration. 

5) D Claim(s) is/are allowed. 

G)M Claim(s) 5.6.9-14 and 29-34 is/are rejected. 

7) D Claim(s) is/are objected to. 

8) D Claim(s) are subject to restriction and/or election requirement. 

Application Papers 

9) D The specification is objected to by the Examiner. 

10) D The drawing(s) filed on is/are: a)D accepted or b)D objected to by the Examiner. 

Applicant may not request that any objection to the drawing(s) be held in abeyance. See 37 CFR 1.85(a). 
Replacement drawing sheet(s) including the correction is required if the drawing(s) is objected to. See 37 CFR 1 .121(d). 

11) D The oath or declaration is objected to by the Examiner. Note the attached Office Action or form PTO-152. 

Priority under 35 U.S.C. § 1 1 9 

12) D Acknowledgment is made of a claim for foreign priority under 35 U.S.C. § 1 19(a)-(d) or (f). 

a)D All b)D Some * c)D None of: 

1 -D Certified copies of the priority documents have been received. 

2. Q Certified copies of the priority documents have been received in Application No. . 

3. Q Copies of the certified copies of the priority documents have been received in this National Stage 

application from the International Bureau (PCT Rule 17.2(a)). 
* See the attached detailed Office action for a list of the certified copies not received. 
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DETAILED ACTION 

Response to Amendment 

1. This office action is in response to the amendment filed 8/20/2004. Accordingly, claims 
1-4, 7-8, 15-18 and 23-28 are canceled, claims 19-22 are withdrawn and claims 5-6, 9-14, and 
29-34 are pending for examination. Since claims 19-22 have been withdrawn from consideration, 
claims 19-22 should be canceled because claims 19-22 have not been further treated on the 
merits. 

Claim Rejections - 35 USC § 103 

2. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

3. Claims 5-6, 9, 1 1-14 and 29-30 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Terui et al (US PAT. 5,684,527 hereinafter Terui) in view of Maekawa et al. (US PAT. 
5,884,257 hereinafter Maekawa). 

Regarding claim 5, Terui discloses a video conferencing system comprising a conference 
bridge (10, figure 1) for interconnecting a plurality of videoconference stations (12-18, figure 1) 
and a speaker identification subsystem (146, figure 7) to determine whether a conferee is 
speaking based on voice level and amount of motion (col. 3 lines 43-58, col. 5 lines 8-14 and col. 
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8 lines 1-4). In addition Terui also teaches the subsystem to compare level of voices made at 
each point in order to determine a speaker (col. 3 line 59 through col. 4 line 46 and col 7 line 67 
through col. 8 line 4) so that one skill in the art would recognize Terui teaching the subsystem to 
determine which of the conferees is speaking the loudest when multiple conferees are speaking 
simultaneously. Terui differs from the claimed invention in not specifically teaching the 
subsystem for determining whether a conferee is speaking based on whether visual lip 
movements of said conferee are reasonably consistent with an audio signal from a conference 
station in which said conferee is located. However, Maekawa teaches voice recognition 
apparatus for determining whether or not a speaker based on the voice signal and the lip 
movement signal, in order to prevent mis-recognition due to ambient noise (col. 5 line 11 
through col. 7 line 37). Therefore, it would have been obvious to one having ordinary skill in the 
art at the time the invention was made to modify Terui in having means for determining whether 
the conferee is speaking, at least in part, based on whether lip movements of said conferee are 
reasonably consistent with an audio signal from the conference station in which said conferee is 
located, as per teaching of Maekawa, in order to prevent mis-recognition due to ambient noise. 

Regarding claim 6, Terui disclose to compare voice levels at each point (col. 3 lines 61- 
63). Thus, the subsystem obviously comprises a voice activity detector. 

Regarding claim 9, Maekawa teaches to including image analysis and recognition 
software (4, figure 1). 

Regarding claim 11, Terui discloses a videoconference station (12, figure 1) comprising a 
transmitter (26, figure 1) to transmit a combined audio and video signal to a videoconference 
bridge (10, figure 1) and speaker identification subsystem (146, figure 7) located at the 
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videoconference station to determine whether a conferee is speaking based on amount of motion 
(col. 3 lines 43-58, col. 5 lines 8-14 and col. 8 lines 1-4). In addition Terui also teaches the 
subsystem to compare level of voices made at each point in order to determine a speaker (col 3 
line 59 through col. 4 line 46 and col. 7 line 67 through col. 8 line 4) so that one skill in the art 
would recognize Terui teaching the subsystem to determine which of the conferees is speaking 
the loudest when multiple conferees are speaking simultaneously. Terui differs from the claimed 
invention in not specifically teaching the subsystem for determining whether a conferee located 
at the video conference station is speaking by analyzing whether visual lip movements of said 
conferee are substantially consistent with an audio signal from the video conference station in 
which said conferee is located so as to produce human speech. However, Maekawa teaches voice 
recognition apparatus for determining whether or not a speaker based on the voice signal and the 
lip movement signal, in order to prevent mis-recognition due to ambient noise (col. 5 line 1 1 
through col. 7 line 37). Therefore, it would have been obvious to one having ordinary skill in the 
art at the time the invention was made to modify Terui in having means for determining whether 
a conferee located at the video conference station is speaking by analyzing whether visual lip 
movements of said conferee are substantially consistent with an audio signal from the video 
conference station in which said conferee is located so as to produce human speech, as per 
teaching of as per teaching of Maekawa, in order to prevent mis-recognition due to ambient 
noise. 

Regarding claim 12, the limitations of the claim are rejected as the same reasons set forth 
in claim 6. 
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Regarding claims 14, Terui discloses method of displaying images of a plurality of 
conferees (12-18, figure 1) in a videoconference system comprising the steps of determining 
whether a conferee is speaking by analyzing audio signal and amount of motion from a 
conference station in which the conferee is located and visually altering an iinage of at least one 
of a plurality of remotely located conferees when said conferee is determined to be the loudest 
speaker (coL3 line 43 through col col. 4 line 46 and col. 8 lines 1-4). In addition Terui also 
teaches the subsystem to compare level of voices made at each point in order to determine a 
speaker (col 7 line 67 through col. 8 line 4) so that one skill in the art would recognize Terui 
teaching the subsystem to determine which of the conferees is speaking the loudest when 
multiple conferees are speaking simultaneously. Terui differs from the claimed invention in not 
specifically teaching the step of determining whether a conferee located at the video conference 
station is speaking by analyzing whether visual lip movements of said conferee are substantially 
consistent with an audio signal from the video conference station in which, said conferee is 
located so as to produce human speech. However, Maekawa teaches voice recognition apparatus 
for determining whether or not a speaker based on the voice signal and the lip movement signal, 
in order to prevent mis-recognition due to ambient noise (col. 5 line 1 1 through col. 7 line 37). 
Therefore, it would have been obvious to one having ordinary skill in the art at the time the 
invention was made to modify Terui in having the step of determining whether the conferee is 
speaking by analyzing the consistency between lip movements of said conferee and an audio 
signal from a conference station in which said conferee is located such that the combination of 
lip movement and audio signal indicates human speech, as per teaching of Maekawa, in order to 
prevent mis-recognition due to ambient noise. 
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Regarding claim 29, Terui discloses the videoconferencing system further comprising 
means for visually altering an image of said conferee displayed in other conference station if said 
conferee is determined to be the loudest speaker of the plurality of conferees (col. 4 lines 22-46). 

Regarding claim 30, the limitations of the claim are rejected as the same reasons set forth 
in claim 14. 

4. Claim 10 is rejected under 35 U.S.C 103(a) as being unpatentable over Terui et al. (US 
PAT. 5,684,527 hereinafter Terui) in view of Maekawa et al. (US PAT. 5,884,257 hereinafter 
Maekawa) as applied in claim 29 above, and further in view of Ogata et al. (JP 06062400A 
hereinafter Ogata). 
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Regarding claim 10, the combination of Terui and Maekawa differs from the claimed 
invention in not specifically teaching means for visually altering the image comprising means for 
highlighting a border around the image of the conferee determined to be the loudest speaker. 
However, Ogata teaches to display a red rectangular marker in a window display frame to 
indicate who is a speaker in order to easily specify who is a speaker (abstract). Therefore, it 
would have been obvious to a person of ordinary skill in the art at the time the invention was 
made to modify the combination of Terui and Maekawa in having means for highlighting a 
border around the image of the conferee determined to be the loudest speaker, as per teaching of 
Ogata, in order to easily specify who is a speaker. 

5. Claims 31-32 are rejected under 35 U.S.C. 103(a) as being unpatentable over Terui et al. 
(US PAT. 5,684,527 hereinafter Terui). 

Regarding claim 31, Terui discloses a method for identifying which conferee in a video 
conference is a speaking comprising the steps of detecting audio signal associated with each 
conferee, and comparing the first detected audio signal with teach conferee to determine which 
detected audio signal is louder (col. 3 line 43 through col. 4 line 46 and col. 7 line 67 through 
col. 8 line 4). Note Terui teaches to compare level of voices made at each point in order to 
determine a speaker and to alter the image in response to the determination (col. 3 line 59 
through col. 4 line 46 and col. 7 line 67 through col. 8 line 4) so that one skill in the art would 
recognize Terui teaching to provide indication, i.e. vividly display, to each conferee of which 
detected audio signal the loudest. 
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Regarding claim 32, Terui teaches the steps of altering an image of either the first 
conferee or the second conferee based, at least in part, on which detected audio signal is 
determined to be louder (col. 4 lines 30-32). 

6. Claims 33-34 are rejected under 35 U.S.C. 103(a) as being unpatentable over Terui et al. 
(US PAT. 5,684,527 hereinafter Terui) in view of Ogata et al (JP 06062400A hereinafter 
Ogata). 

Regarding claims 33-34, Terui differs from the claimed invention in not specifically 
teaching altering the image comprising means for highlighting a border around the image on 
which received audio signal is determined to be the louder and providing textual information 
indicating which detected audio is determined to be louder. However, Ogata teaches to display a 
red rectangular marker in a window display frame to indicate who is a speaker in order to easily 
specify who is a speaker (abstract). Therefore, it would have been obvious to a person of 
ordinary skill in the art at the time the invention was made to modify Terui in having means for 
highlighting a border around the image on which received audio signal is determined to be the 
louder and providing textual information indicating which detected audio is determined to be 
louder, as per teaching of Ogata, in order to easily specify who is a speaker. 

Response to Arguments 

7. Applicant's arguments with respect to claims 5-6, 9-14 and 30-34 have been considered 
but are moot in view of the new ground(s) of rejection. 
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Conclusion 

8. Applicant's amendment necessitated the new ground(s) of rejection presented in this 
Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP § 706.07(a). 
Applicant is reminded of the extension of time policy as set forth in 37 CFR 1 .136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1.136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the date of this 
final action. 

9. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to George Eng whose telephone number is 703-308-9555. The 
examiner can normally be reached on Tue-Fri 7:30 AM-6:00 PM. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Curtis A. Kuntz can be reached on 703-305-4708. The fax phone number for the 
organization where this application or proceeding is assigned is 703-872-9306. 
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Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 



George Eng 
Primary Examiner 
Art Unit 2643 




